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Abstract 



We consider the task of coniputing an approximate minimizer of 
the sum of a smooth and non-smooth convex functional, respectively, 

^^ ■ in Banach space. Motivated by the classical forward-backward split- 

(-H I ting method for the subgradients in Hilbert space, we propose a gen- 

+-^ ■ eralization which involves the iterative solution of simpler subprob- 

lems. Descent and convergence properties of this new algorithm are 
studied. Furthermore, the results are applied to the minimization 
of Tikhonov-functionals associated with linear inverse problems and 

^S] ■ semi-norm penalization in Banach spaces. With the help of Bregman- 

K^ I Taylor-distance estimates, rates of convergence for the forward-back- 

ward splitting procedure are obtained. Examples which demonstrate 
the applicability are given, in particular, a generalization of the it- 

("^ • erative soft-thresholding method by Daubechies, Defrise and De Mol 

I • I to Banach spaces as well as total- variation based image restoration in 

^—v ■ higher dimensions are presented. 

oo : 

f^ , Keywords: forward-backward splitting, convergence analysis, Banach 

spaces, inverse problems, iterative thresholding 
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1 Introduction 

The purpose of this paper is, on the one hand, to introduce an iterative 
forward-backward splitting procedure for the minimization of functionals of 
type 

min F{u) + $(«) (1) 
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in Banach spaces and to analyze its convergence properties. Here, F repre- 
sents a convex smooth functional while the convex ^ is allowed to be non- 
smooth. On the other hand, the application of this algorithm to Tikhonov 
functionals associated with linear inverse problems in Banach space is stud- 
ied. We consider, in particular, general regularization terms which are only 
powers of semi-norms instead of norms. Moreover, examples which show 
the range of applicability of the algorithm as a computational method are 
presented. 

The forward-backward splitting algorithm for minimization in Banach 
space suggested in this work tries to establish a bridge between the well- 



known forward-backward splitting in Hilbert space [2J, [ll|, [13] and mini- 
mization algorithms in Banach space, which require in general more anal- 
ysis. For example, in the situation of Banach spaces, gradient-based (or 
subdifferential-based) methods always have to deal with the problem that 
the gradient is an element of the dual space and can therefore not directly 
used as a descent direction. One common approach of circumventing this 
difficulty is performing the step in gradient direction in the dual space and 
use appropriate duality mappings to link this procedure to the primal space 
P, 123] . Such a procedure applied to ([I]) can be seen as a full explicit step 
for d{F + <1>) and convergence can often be achieved with the help of a-priori 
step-size assumptions. In contrast to this, forward-backward splitting algo- 
rithms also involve an implicit step by applying a resolvent mapping. The 
notion of resolvents can also be generalized to Banach spaces by introduc- 
ing duality mappings [21'], combining both explicit and implicit steps to a 
forward-backward splitting algorithm, however, has not been considered so 
far. 

The paper can be outlined as follows. We present, in Section El a gener- 
alization of the forward-backward splitting algorithm which operates in Ba- 
nach spaces and coincides with the usual method in case of Hilbert spaces. 
The central step for the proposed method is the successive solution of prob- 
lems of type 

II r? IIP 

niin ^ \lK+rJ{F'{u^),v)+^{v] 



vex p 

which are in general easier to solve than the original problem. In particular, 
we will show in Section [3] that the algorithm stops or the functional values 
converge as n — > oo with an asymptotic rate of n^~P if X is reflexive and 
F' is locally {p— 1)-H61der continuous. There, we only have to assume that 
the step-sizes obey some upper and lower bounds. Moreover, under certain 
conditions, strong convergence with convergence rates will be proven, in 
particular that the g-convexity of <I> implies the rate n^^"^^'''. 

In Section [H we will apply these results to the linear inverse problem of 
solving Ku = f with K : X ^^ Y which is regularized with a semi-norm of 
some potentially smaller space as penalty term and leading to the problem 



of minimizing a non-smooth Tikhonov-functional: 

mm h a n z 

The forward-backward sphtting algorithm apphed to these type of functional 
will be discussed. It turns out that we have convergence with rate rv^~P)i'i 
whenever the data space Y is p-smooth and q'-convex and the regularizing 
semi-norm is g-convex in a certain sense. 

In Section [5l examples are given on how to compute the algorithm. Also, 
basic numerical calculations are shown. We consider linear inverse problems 
with sparsity constraints in Banach spaces which leads to a generalization 
of the results for the iterative soft-thresholding procedure in [IJ] to Banach 
spaces. Moreover, it is also discussed how to apply the algorithm to lin- 
ear inverse problem with total- variation regularization in higher dimensions 
where an embedding into the Hilbert space L^(J7) is impossible. The article 
finally concludes with some remarks in Section [6l 

2 A forward-backward splitting algorithm in Ba- 
nach space 

Let X be a reflexive Banach space in which the functional ([T]) has to be 
minimized. Assume that both F and $ are proper, convex and lower semi- 
continuous functionals such that F-\-^ is coercive. In this setting, we require 
that F represents the "smooth part" of the functional where <I> is allowed 
to be non-smooth. Specifically, is it assumed that F is differentiable with 
derivative which is, on each bounded set, [p — l) Holder-continuous for some 
1< p < 2, i.e. 

\\F'{u)-F\v)\\x^<\\F'%.4u-vr-^ 

for ||tt||x, \\v\\x < C. 

We propose the following iterative procedure in order to find a solution 
of dH). 

1. Start with a. u^ & X with (^{u^) < cx) and n = 0. Estimate, using the 
coercivity of i^ -|- $, a norm bound ||n||x < C for all {F + ^)iu) < 
(F + ^)(u^) and choose ||-F'||p_i accordingly. 

2. Compute the next iterate u""^^ as follows. Compute w^ = F'{u^) and 
determine, for an r„ > satisfying 

< r < T„ , Tn< ^§7^ , (2) 



(with < (5 < 1) the solutions of the auxihary minimization problem 

II nllP 

1 ?/ / \ 

min ^ ^ + r„ {w'^, v) + $(v) . (3) 

Choose, if necessary, -u""^^ as a solution of (l3|) which minimizes |[t' — u"'||x 
among all solutions w of ([3]) to ensure that the fixed points of u" i-^ 
yn+i j^j.g exactly the minimizers of ([1]). 

3. If n""*"^ is not optimal, continue with n := n + 1 and repeat with Step 
2, after optionally adjusting ||F'j|p_i analogously to Step 1. 

Remark 1. This iterative procedure can be seen as a generalization of the 
forward-backward splitting procedure for the subgradients (which is in turn 
some kind of generalized gradient projection method [51]) in case of X and 
Y b eing a Hilbert spaces and p = 2. This reads as, in terms of resolvents 
(see [l7|, la, |29| , for example, for introductions to these notions), 

«"+i = (7 + r„5$)-i(/-r„F')(n") 

or, equivalently, 

u"+' = argmm^gj^ ^ ^ '-^ + t„$(w) . (4) 

Such an operation does not make sense in Banach spaces, since F'(n") € X* 
cannot be subtracted from u" G X. However, @ is equivalent to 

II ^ l|2 

n"+i = argmin,^^ ^iZ^ + r„((F'(n"), v) + ^v)) . 

which only involves a duality pairing for F'{u^). Hence, one can replace 
\\v — u'^\\y/2 by \\v — u^W^/p and X, Y with Banach spaces and ends up 
with ([3]), which defines a sensible operation (see also Proposition [T]). De- 
noting Jp = d\\ ■ \\x/p and Pr{u,w) = {Jp{- — u) + rd^) (w), one can 
interpret u" i— > (u", —TnF'{u^)^ as being a generalized forward step while 
(«"", —TnF^u"^)^ \-^ Pt-„ (u", —TnF'{vP')) represents the corresponding gener- 
alized backward step. 

In the following, we will see that the iteration is indeed well-defined, 
i.e. one can always solve the auxiliary problem ([3|). Moreover, it is shown 
that the corresponding fixed points are exactly the solutions of ([I]). We start 
with proving the well-definition. 

Proposition 1. The problem 

II IIP 

min — + T\{w, v) +(^iv)] (5) 

v&x p V ' ^ 'J ^ ' 

has a solution for each u (z X , w (z X* and r > 0. Moreover, there always 
exists a solution u'^'^^ which minimizes \\v — n"||x among all solutions v. 



Proof. The case r = is trivial, so let r > in the following. We first 
show the coercivity of the objective functional. For this purpose, note that 
Npiv) = \\v — u||^/p grows faster than ||u||x5 i-e. Np{v)/\\v\\x — > oo when- 
ever \\v\\x -^ CO. Moreover, for an u* minimizing ([1]) we have ^(v) > 
$(«*) + {w* , V — u*) where —w* = —F'{u*) € d^{u*) since F is continuous 
in X implying that d{F + <1>) = F' + d^, see [29|. Hence, we can estimate 



\v — u 



V 



+ tUw, v) +$(u)j 



^11 II (^^M. II ^ *ll J. Hu*)-{w\u*) \ ^ ^„ 
^ r U: irii rnw + w \\x* + T n~~n > i^\\v\\x 

^ \m\x \\v\\x ' 

for some L > and all v whose norm is large enough, showing that the 
functional in (l5|) is coercive. 

It follows that the functional in ([5|) is proper, convex, lower semi-con- 
tinuous and coercive in a reflexive Banach space, consequently, at least one 
solution exists. Finally, denote by M the set of solutions of ([5]), which is non- 
empty. Also, M is convex and closed since the functional in ([5]) is convex 
and lower semi-continuous, respectively, in the reflexive Banach space X. 
Consequently, by standard arguments from calculus of variations, 

min llu — n"|[x + -/^Af('y) 

admits a solution. Thus, n"^^ is well-defined. D 

Proposition 2. The solutions u* of the problem ([T]) are exactly the fixed 
points of the iteration (for each Tn > 0). 

Proof. Suppose that u* = vP" is optimal. Then, since d{F + <1>) = F' + d^ 
we know that w* = F'{u*) also satisfies —w* G d^{u*). Now, the solutions 
of ([3j) can be characterized by each v & X solving the inclusion relation 

with Jp being the p-duality relation in X, i.e. Jp = d-\\- ||^. Obviously, 
V = u* \s a. solution and from the requirement that n""*"^ is the solution 
which minimizes ||f — m"||x among all solutions follows u""*""^ = u", hence 
u* is a fixed point. 

On the other hand, if u" = n""*"^, then, for the corresponding tf" holds 

since Jp(0) = {0} (by Asplund's theorem, see [13]), meaning that u" is 
optimal by u;" = F'{u^). D 



These results justify in a way that the proposed iteration, which can be 
interpreted as a fixed-point iteration, makes sense. It is, however, not clear 
whether we can achieve convergence to a minimizer if the optiniality condi- 
tions are not satisfied. For this purpose, the algorithm has to be examined 
more deeply. We will use the descent of the objective functional in ([T]) in 
order to obtain conditions under which convergence holds. 

3 Descent properties and convergence 

This section deals with descent properties for the proximal forward-backward 
splitting algorithm. It is shown that each iteration leads to a sufficient de- 
scent of the functional whenever ([2]) is satisfied. Since the minimization 
problem ([T]) is convex, we can also obtain convergence rates for the func- 
tional, which will be done in the following. The proof is essentially based 
on four steps which are in part inspired by the argumentation in [l5|, y, |5| . 
After introducing a functional D[vP') which measures the descent of the ob- 
jective functional, we prove a descent property which, subsequently, leads 
to convergence rates for the distance of the functional values to the mini- 
mum. Finally, under assumptions on the Bregman (and Bregman- Taylor) 
distances, convergence rates for the distance to the minimizer follow. 

Lemma 1. For each iterate and each v (z X , we have the inequality 

Tn 

Furthermore, with 

D{u'') = $(«") - $K+i) + (w;", u" - u"+i) (7) 

it holds that 



|^n_^n+l||P 



^ < DK) . (8) 



Tn 

Proof. Since ti"+^ solves ([3]) with data tu" and n", the subgradient relation 

-JpK+^ - u") € r„(5$(u"+^) + u;") 

holds and, consequently, by the subgradient inequality, 

$(n"+i) + {w'\ n"+i) - T-\jp{u''+^ - n'^), V - u"+^) < ^{v) + (u;", v) 

for each v € X. Rearranging terms and noting that Jp{—u) = —Jp{u) for 
each n € X yields ©• The inequality dS]) then follows by letting v = u^ 
and noting that {Jp{u^ — n"+^), u" — u"'^^) = \\u"' — u"'~^^\\^ by definition 
of the duality relation. D 

6 



Next, we prove a descent property which wih be crucial for the conver- 
gence analysis. It will make use of the "descent measure" D{u^) introduced 
in dZD. 

Proposition 3. The iteration satisfies 

{F + cl>)(n"+i) < (F + cl>)(n") - (l - r„^^^^) D{u") (9) 

with Diu"^) > defined by d?]). 

Proof. First note that from ^ follows that D{u^) > and, together with 
Proposition O that D{u"') = if and only if u" is optimal. Using the 
definition of D[vP') gives 

(10) 
Note that we can write 

F(n"+i) - F(n") - K, u"+^ - u") = / {w{t) - «;", u^+^ - u") dt (11) 

Jo 

with i(;(f) = F'(u" + i(u""'""^ — -u")). We now want to estimate the absolute 
value of ()lip in terms of D(u"'): 







(t^(t) - w"" , u"+^ - u") dt < / ||u;(t) - w'^Wx* ||n'^+' - n"||x dt 



n+l 







< / ||F'||p_i||t(n'^+i-n")||^-in'^+i-n"||xdt 

JO 

by employing the Holder-continuity assumption as well as the estimate dS]). 
The claimed statement finally follows from the combination of this with 

(HUD- □ 

Note that Proposition [3] together with ([2]) yields a guaranteed descent 
of the functional F + ^. Since in this case the boundedness < r < t„ 
is also given, the convergence of the functional values to the minimum is 
immediate as we will see in the following lemma. But first, introduce the 
functional distance 

r„ = (F + $)(u")- (min (F + $)(«)) (12) 

which allows us to write Q as: 

rn-rn+i> {i-t J^'^^''-' )d{u^) . (13) 



Lemma 2. Assume that F-\-^ is coercive or the sequence {u"} is bounded. 
Then, the sequence {r„} according to (|12p satisfies 

Tn - rn+i > CorP 

for each n, with p' the dual exponent \/p + 1/p' = 1 and some cq > 0. 

Proof. First note that, according to Proposition [3l {r„} is a non-increasing 
sequence. If F + $ is coercive, this immediately means Hu" — ii*|| < Ci for 
some minimizer u* ^ X and all n. The same holds true if {«""} is already 
bounded. 

Observe that the convexity of F as well as ([6]) gives 

Tn < ^K) - $K) + (u;", U" - U*) 

= Z)(n") + $(u"+i) - <^{u*) + (u;", u"+i - u*) 
< D{u'') + T-^(Jp(u" - u"+i), n"+i - u*) 
<Z)K)+T-^||JpK-n"+i)||x.|ln"+i-u*||x. 



Now, since p > 1, 



where p' is the dual exponent, i.e. ^ + -^7 = 1. Further, applying ([8]), (fT3]) 
and taking the step-size constraint f2]) into account yields 

5rn < r„ - r„+i + C7i(r„ - r„+i)i/P'(5T-i)i/P . 

Note that r„ — r„+i < tq since {r„} is non-increasing. This finally gives 

5rn < (r^^ + C,{6T-'')'/P){rn - r.+O^/^' 



(<5-irro)i/p + Ci 

D 

Proposition 4. Under the prerequisites of Lemma 0, ^/le functional values 
for {«"} converge to the minimum with rate 

Tn < Cn^-P 

for some C > 0. 



Proof. Apply the mean value theorem to get the identity 



1 


1 


(r„+ir„)P'-i 


(/- 


_l)^p'-2(^^_^^^^) 


'n+1 


In 


(rn+ir„)P'-i 



with r„+i < "!? < r„. Thus, t?^ ^ > r^^-^ r^-"^ and, by Lemma [21 

^ 7Z — r^;73i — = (p - Ijco • 



Summing up then yields 

n-l 



1 1 v-^ 1 1 , , ^ 



i=0 'i+1 






and consequently, 

r^-' < {rl'"' + Co{p' - l)n) '' => r„ < Cn'-P 

since 1/(1 — p') = 1 — p. D 

This result immediately gives us weak subsequential convergence to some 
minimizer. 

Proposition 5. In the situation of Leninial^ the sequence {u"} possesses 
at least one weak accumulation point. Each weak accumulation point is a 
solution of ^. In particular, if the minimizer u* is unique, then n" ^- u* . 

Proof. Since r„ < n^~^, the sequence is a minimizing sequence, thus, due to 
the weak lower semi-continuity of -F+$, each weakly convergent subsequence 
is a minimizer. Moreover, it also follows that {u"} is a bounded sequence 
in the reflexive Banach space X, meaning that there is a weakly-convergent 
subsequence. The statement n" ^ u* in case of uniqueness follows by the 
usual subsequence argument. D 

To establish strong convergence, one has to examine the functionals F 
and $ more closely. One approach is to consider the following Bregman-like 
distance of $ in solution u* of ([T]): 

R{v) = <^{v) - $(n*) + {F'{u*), V - u*) (14) 

which is non-negative since —F'{u*) G d^{u*). Note that if d^{u*) is con- 
sisting of one point, R is indeed the Bregman distance. By optimality of u* 
and the subgradient inequality, we have for the iterates n" that 

rn > ^(n") - $(n*) + {F'{u*), u" - u*) = R{u'') 

hence R{u^) < Cn^~^ by Proposition HI Also note that R{u) = for optimal 
u. The usual way to achieve convergence is to postulate decay behaviour for 
R as the argument approaches u* . 



Definition 1. Let <I> : X ^ R U {00} be proper, convex and lower semi- 
continuous. The functional $ is called totally convex in u* £ X, if, for each 
w G d^{u*) and {«"} it holds that 

$(«") - $(n*) - {w, u" - u*) ^ => ||u" -u*\\x^O . 

Likewise, $ is convex of power-type q (or q-convex) in u* (z X with a. q & 
[2, 00 [, if for all M > and w S 9$(ti*) there exists a c > such that for all 

11^ ~ u*\\x < M we have 



$(n) - $(n*) - {w, u- u*) > c\\u 






The notion of total convexity of functionals is well-known in the literature 

7, la], convexity of power type q is also referred to as q-uniform convexity 

3|. The former term is, however, often used in conjunction with norms of 

Banach spaces for which an equivalent definition in terms of the modulus of 

convexity (or rotundity) is used [32]. 

Now, if <1> is totally convex in u* , then the sequence {u"'} also converges 
strongly to the minimizer since R{vP') — > 0. Additionally, the minimizer has 
to be unique since \\u** — ^i*|| > and R{u**) = would violate the total 
convexity property. The latter considerations prove: 

Theorem 1. If ^ is totally convex, then {n"} converges to the unique min- 
imizer u* in the strong sense. 

Remark 2. The notion of total convexity is well-known in the study of con- 
vergence of numerical algorithms and can be established for a variety of 
functionals of interest, for instance, for ^(u) = ||n||y if Y is (locally) uni- 
formly convex and r > 1 [9(]. If y is continuously embedded in X, then 

ll^i" - u*\\x < Cllu" - u*\\y, thus i?(n") -^ implies Hu" - u*\\y -^ and 
consequently ||n" — ti*||x — > 0. 

As one can easily see, the notion of g-convexity of <1> in u* is useful to 
obtain bounds for the speed of convergence: Additionally to strong con- 
vergence (o'-convex implies totally convex), since {«"■} is bounded, one can 
choose a bounded neighborhood in which the sequence is contained and 
obtains 

Ijn" - u*\\x < (c-^i?(n"))^/'' < (c-V„)i/5 < c-Vgc7i/%(i-p)/'? 

for all n meaning that n" — > u* with asymptotic rate n'-"'^"^'''?. So, we can 
note: 

Theorem 2. //<!> is q-convex, then {n"} converges to the unique minimizer 
u* with asymptotic rate n^^~P>''^. 
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Remark 3. Again, a variety of functionals is g-convex, typically for q > 2. 
The notion translates to norms as follows: If y is a Banach space which 
is convex of power-type q for q > 1 (see [23( for an introduction to this 
notion) which is continuously embedded in X, then $(u) = \\u\\y is ^'-convex. 
Consequently, one obtains convergence with rate n^^~'^>''^. 

Note that many known spaces are g-convex. For instance, if r > 1, 
each L^(Q) is convex of power-type max{2,r} for arbitrary measure spaces 



and the respective constants are known, see [19|, l25|]. The analog applies 
to Sobolev spaces H"^''^{Q) associated with arbitrary domains O: They are 
also convex of power-type max{2, r}. In fact, as a consequence of a theorem 



of Dvoretsky 16|], any Banach space can be at most convex of power-type 2 



While the Bregman-distance (or the Bregman-like distance R) turns out 
to be successful in proving convergence for the minimization algorithm for 
many functionals, there are some cases, in which $ fails to be q-convex or 
totally convex. In these situations, one can also take the Taylor-distance of 
F into account, which is the remainder of the Taylor-expansion of F up to 
order 1: 

T{v) = F{v) - F{u*) - {F'{u*), V - u*) (15) 

Since F is convex, T{v) > 0, hence one can indeed speak of some distance 
(which is in general not symmetric or satisfying the triangle inequality). In 
fact, the Taylor-distance is also a Bregman-distance, but here we make the 
distinction in order to emphasize that the Taylor- and Bregman-distances 
are the respective smooth and non-smooth concepts for measuring distances 
by means of functionals. 

With the introduction of R and T, the distance of {F + ^)iv) to its 
minimizer can be expressed as 

{F + <^){v) -{F + <^){u*) = T{v) + R{v) , (16) 

which means that it can be split into a Bregman part (with respect to $) 
and a Taylor part (with respect to F). In some situations, this splitting can 
be useful, especially for minimizing functionals of the Tikhonov-type, as the 
following section shows. 

4 Application to Tikhonov functionals 

Consider the general problem of minimizing a typical Tikhonov-type func- 
tional for a linear inverse problem, 

mm —+a — — (17) 

u&x r s 

where r>l,s>l,a>0, Xisa reflexive Banach space, Y a Banach 
space with K : X ^f Y continuously and some given data f a Y. Finally, 
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let I • 1^ be a senii-noriii of the (not necessarily reflexive) Banach space Z 
which is continuously embedded in X. It becomes a minimization problem 
of type ([I]) with 

Fiu) = ^-1^ , $ u) = -L-1^ 18 

r s 

with the usual extension $(n) = oo whenever u G X\Z. The aim of this 
section is to analyze (I17p with respect to the convergence of the forward- 
backward splitting algorithm applied to (fT8|) . 

First of all, we focus on the minimization problem and specify which 
semi-norms we want to allow as regularization functionals. Also, we need a 
condition on the linear operator K. 

Condition 1. 

1. Let \-\z be a semi-norm on the Banach space Z such that Z = Zq®Z\ 
with Zq = {z ^ Z : \z\z = 0} closed in X and \ ■ \z being equivalent 
to II • 11^ on Z\. 

2. Let K : X ^ Y be a linear and continuous operator such that K, 
restricted to Zq C X is continuously invertihle. 

Remark 4. The first condition in Condition [1] is satisfied for many semi- 
norms used in practice. For example, consider X = L^{Q) and Z = H^'^{Q) 
with jzl^ = IIVzIIj. Assume that f] is a bounded domain, then Zq = 
spanjxn} is closed in X and Zi = {z € i7^'*(0) : J^z dx = 0}, for in- 
stance, gives Z = Zq® Zi. Provided that the Poincare-Wirtinger inequality 
holds (with constant C), we have for z G Zi 

Mt + \\Vz\\t < (C7 + l)||Vz||i < iC + l){\\z\\t + llVzlli) 

meaning that \- \z and || • \\z are indeed equivalent on Zi. 

Moreover, note that by considering X/(ker(ir)nZo) and Z/(ker(iC)nZo) 
instead of X and Z, respectively, the second condition in Condition [T] is 
satisfied whenever the range of K restricted to Zq is closed in Y (by the 
open mapping theorem). This is in particular the case when Zq is finite- 
dimensional. 

Let us briefiy obtain the existence of minimizers under the above condi- 
tions. 

Proposition 6. Under the assumption that Condition [7] is satisfied, the 
minimization problem (I17p possesses at least one solution in X . 



Proof. First of all, note that since Z = Zq®Z\ with Zq and Z\ closed, there 
is, due to the closed-graph theorem, a continuous projection P : Z ^> Z 
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such that rg(P) = Z\ and ker(P) = Zq. With this projection, ||P- ||^ is 
equivalent to \-\z (see also [22] for a similar situation). 

Verify in the following that P+<^ is coercive in X. Suppose that $(u"') < 
C\ for some sequence {«"} C X. In particular, {u"} C Z by the definition 
of $ and ||Pm"||x < C'2||P«"||z < Cj\u^\z < C4 meaning that Pu"- makes 
sense and is bounded in X whenever <l>(u") is bounded. Likewise, examining 
F, we get with Q = I — P : Z ^ Zq and Taylor expansion that 



\\Kv — fir 1 
F{u) = ^ ^^ > -\\\KPu - fWv - \\KQu\\y 



\r 



> "^^" ^"^ - \\KPu - f\\^y-'\\KQu\\y 
r 

+ ^ max {\\KPu - /||y, \\KQu\\YV-^\\KQufY ■ 

Now suppose there is a sequence {tt"} where P + $ is bounded. The 
claim is that H-fCQw^Hy is also bounded. Assume the opposite, i.e. that 
IliiTQu"!! — > cxD. From the above argumentation we have ||Pti"||x < C4, 
hence \\KPu^ ~ IWy < C's- Consequently, we can assume, without loss of 
generality, that ||i^(5u"||y > \\KPu"' — /||y. The estimate on F then reads 
as 

P(n") > -C^-^WKQu^Y + {r- l)||KQn"||^ 

implying that F{u'^) — > 00, which is a contradiction. 

Consequently, there holds ||/CQ?x'^||y < C^. By Condition [H K is con- 
tinuously invertible on Zq C X, so finally 

||w"lk < WQu^X + WPU^X < CjWKQu^y + Ci<C8 

showing that P + $ is coercive. 

Consequently, P + <1> is proper, convex, lower semi-continuous and coer- 
cive on the reflexive Banach space X, so at least one minimizer exists. D 

Next, we turn to examining the problem more closely and some proper- 
ties of the functionals in (|17p . Both P and $ are convex, proper and lower 
semi-continuous. The differentiability of P, however, is strongly connected 
with the differentiability of the norm in Y, since K is arbitrarily smooth. 
Since the forward-backward splitting algorithm demands also some local 
Holder-continuity for P', we have to impose conditions. The following no- 
tion is known to be directly connected with the differentiability of the norm 
and the continuity of the derivatives. 

Definition 2. A Banach space Y is called smooth of power-type p (or p- 
smooth) with p € ]1,2], if there exists a constant C > such that for all 
u,v gY it holds that 



\u 
P P 



^ ^ - {jp{u),v -u) <C\\v -u\\^ 
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Here, jp = d\\ ■ Wy/p denotes the p-duality mapping between Y and Y*. 

Now the central result which connects p-smoothness with differentiability 
of the norm is the following (see [33] ) : 

Proposition 7. IfY is smooth of power-type p, then \\ ■ Wy/p is continuously 
differ entiahle with derivative jp which is moreover (p— 1) Holder- continuous. 
Furthermore, for r > p, the functional || • ||y/T is continuously differen- 
tiable. Its derivative is given by jr which is still [p — 1) Holder- continuous 
on each hounded subset ofY. 

Assuming r > p and that Y is p-smooth gives us, together with the usual 
differentiation rules, that F is differentiable with derivative 

F'{u) = K*jr{Ku-f). 

Since K is continuous, jr is Holder continuous of order p— 1 on bounded sets 
by Proposition [7] and keeping in mind that ||jV||p-i might vary on bounded 
sets, we can estimate the norm |[F'|[p_i on bounded sets as follows: 

\\F'\\p-i < ||K*|||b;||p-i|ii^ir^ < ||>|!p_i||i^||p . 

Hence, F' possesses the Holder-continuity required by the convergence re- 
sults for the forward-backward splitting algorithm. 

As already noted in Theorems [T] and EJ convexity of the penalty can lead 
to convergence with some rate. Here, we also want to extend the results to 
certain convexity of semi-norms which we define as follows: 

Definition 3. Let | • |^ be a semi-norm according to Condition [H Then, 
the functional ^ = a\- |^/s is called totally convex, if, for each fixed z* G Z, 
each sequence {z"'} in Z and C ^ d^{z*) there holds 

I nls |?*|* 

a^—^-a^—^-{C,z''-z*)^0 ^ |z"-z*|z^O. 
s s 

Likewise, ^ = -^ is called convex of power-type q in z* if, for each M > 
and C £ d^{z*) there exists a c > such that for all \\z — z*\\z < M the 
estimate 



a — — — a — — {(,, z — z } > c\z — z \^^ 



is satisfied. 



Remark 5. Analogously to Proposition [71 one knows that if a | ■ |^/? is 
convex of power-type q on bounded sets for some q > 2, then the functional 
<I> = I • l^/s is also convex of power-type q for all 1 < s < (7 on bounded sets. 
The same holds true for the convexity of power-type for norms according to 
Definition [Ij 
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With this notion, one is able to give convergence statements and rates 
for the algorithm applied to the minimization problem (|17p . 

Theorem 3. If, in the situation of Proposition\^ the space Y is smooth of 
power-type p for p < r and the functional ^ = a\- \%l s as well as the norm 
in Y is totally convex, then {u"'} converges to the unique minimizer u* . If, 
moreover, \ ■ W/q as well as Y is convex of power-type q for q > max {r, s, 2} 
in a minimizer u* , then u" ^ u* in X with rate 0{n^^~'P'''^). 

Proof. First verify that the sequence {u"} is a minimizing sequence for which 
the associated r^ vanish like n^~^ which means verifying the prerequisites of 
Proposition m Both F and $ are proper, convex and lower semi-continuous 
in the reflexive Banach space X, F' is {p — 1) Holder-continuous on each 
bounded set by Proposition [7] and we already saw in Proposition E] that 
F-|-<1> is coercive. Hence, Proposition [His applicable. It remains to show the 
asserted strong convergence. As already mentioned at the end of Section O 
the Bregman-distance is no longer sufficient in order to show convergence, 
so we have to use Taylor-Bregman-distance estimates. 

For that purpose, consider the Bregman-like distance R according to (I14p 
and Taylor-distance T introduced in (jlSp (both with respect to F and $ as 
chosen in (llSp and in the minimizer u*). Remember that we can split the 
functional distance r„ according to (J12p into Bregman and Taylor parts (J16p , 
i.e. r„ = i?(M") + r(u"). 

First suppose that \-\z is totally convex, meaning that from 

I ?7 I ^ I ^ I q 

r„ > i?(n") = a^^ - a^^ + {F'{u*), u" - u*) 
s s 

and r„ < Cin^^P (see Proposition H]) follows |m" — u*\z -^ 0. On the other 
hand, observe analogously that 

.„ > r(u») = ll^""-^"^ - H!ll^M 

r r 

- {jriKn* - /), {Ku'^ - /) - {Ku* - /)) 

so the total convexity of || • ||y implies ||i^(u"' — u*)||y — > 0. 

Note that since {u""} is a minimizing sequence, we can reuse the argu- 
ments from Proposition [6] as well as the projections P and Q to obtain that 
{u^} C Z and ||P(n" — u*)||z < C3. Thus, we consider u £ Z in the follow- 
ing and are eventually able to set u = u^ — u*. It holds that icg{Q) = Zq, 
hence exploiting again the second part of Condition [T] gives 

\\KQu\\1. > ciWQuW^j^ . 
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Using the integral form of the Taylor expansion of -| • |'^ up to order 2 yields 

\\^^>h\\KQu\\Y-\\-KPu\\YY 
q g' ' 

\\KQu\\lr iig-iiiT^D II I ^-liiR-D iia 

> M_(p(, _:)).-. _,<,_:))^* 

where, for the latter, the inequality a'^^^b < {q')^^£'^ a*^ + q^^e^'^b'^ for a,b > 
has been utilized. 

Together with the estimate on ||i^(5ii||y and the continuity of K, one 
gets 

IIQnll^ < c^'^\\KQu\\'y ^ C^iWKPuWl. + \\Ku\\l.) 
< C6{C7\\K\\i\u\l + \\Ku\\l.) 

and consequently 

||n" -urx< 2'^-\\\P{u^ - «*)||^ + ||Q(n" - ^*)||^) 

<Cs{\u''-u*\l + \\Kiu^-u*)\\l-) ■ (19) 

Since Ri^u^) -^ and T{u^) — > imply \u^ — u*\z ^ as well as 
||Er(ti"' — tt*)||y — ^ 0, respectively, we have convergence u" -^ u* in X. 

Regarding the uniqueness, assume that u** is also a minimizer, hence 
R(u**) = T{u**) = and u** e Z. The total convexity then yields 
\u**-u*\z = ^ \\P{u**-u*)\\z = as weh as \\K{u** -u*)\\y = 0. 
From the latter follows \\Q{u** — u*)\\z = and consequently the unique- 
ness statement \\u** — u*\\z = 0. 

In case | • |^/g is g-convex, we can write, having Remark [5] in mind, 

|'u"-n*||<c^i(a^^-a^^ + (F'(u*), u'^-n*)) = CgR{u'') . (20) 

where C2 depends on the X-norm bound of the sequence {n"}. On the other 
hand, if Y is g-convex, the Taylor-distance can be estimated as follows: 

^ WKu^-m _ \\Ku*-frY 

r r 

- {jr{Ku* - /), {Ku^ - /) - {Ku* - /)) 

>cs\\K{u--u*)rY. (21) 

Hence, (fT9]) together with (f20]l and (f2T|) becomes, because of ([1^ . 



in" - u*\\'^ < Cs{CqR{u^) + c^^T{u^)) < Cior„ . 



\x 



But r„ < Cin^-P, hence n" ^ u* with rate ©(n^^-P^/^). D 
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5 Examples and Applications 

This section demonstrates some applications for the iterative minimization 
procedure discussed in this paper. 

We start with an example which is of rather general nature. It shows that 
the forward-backward splitting procedure amounts to an iterative thresholding 
like algorithm when applied to linear inverse problems with sparsity con- 
straints. Afterwards, numerical computations showing the performance of 
this algorithm in the discrete case are presented. 

Example 1. Consider a linear inverse problem with sparsity constraints in 
Banach spaces. The setting can be summarized as follows: We are given a 
linear and continuous operator K : P" ^> Y where r > 1, y is a p-smooth 
and g-convex Banach space with p<r<q, q>2 and some data f &Y and 
want to solve the problem of minimizing the Tikhonov functional 

mm + } ak (22) 

k=l 

where {ofc} is a sequence bounded byO<a<QA;<a<oo and 1 < s < r. 
Such situations occur, for example, when one tries to solve Au = f for 
some linear and continuous A : X ^ Y , X being an Uiji ) or a Besov 
space Br'^ iW^) which is equivalently described using a properly rescaled 
basis synthesis operator B : i^ ^> X with respect to appropriate scaling 
functions/wavelets [2^. Utilizing the basis-expansion coefficients for reg- 
ularization (giving the s-th power of a norm which is equivalent to some 
Bg '**(R )) and denoting K = AB then leads to Tikhonov functionals of the 



type ^7^, see [ij] for details. 

How does the associated forward-backward splitting algorithm look like? 
First of all, we assume that the duality map jr in Y can be computed. This 
is for example the case for Y = L^ (R ): 

■ / \ / w IP*— 111 ir'"— p* 

Jr{u) = sgn{u)\u\'^ ||ti||p. • 

Note that L^ (R*^) is also smooth of power-type p < rain{2,p*} and one can, 
without greater effort, compute estimates for the Holder-constants ||jV||p-i 
on bounded sets provided that p < r. Consequently, it is reasonable to 
assume that F'{u) = K*jr{Ku — /) is computationally accessible as well as 
the constants needed for ^. 

The main difficulty is therefore computing solutions of ^ which reads 
as, denoting u;" = K*jr{Ku"' — /), 

oo II ri wP—v I ri \i" lis 

Erllv — U \\r l^k — Uh\ ( r, \Vk\ \ , N 

p r \ s J 

k=i ^ 
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Sy,t{x)^ 




> X 



SyA^)^ 



2t 



-> X 



y = 0.75, t = 1.5 


y = 0.75, t = 1.25 


r = 1.5, s = 1.3 


r = 1.3, s = 1 



Figure 1: Illustration of the thresholding-like functions according to ()24p for 
some parameters. 



It is easy to see that the only coupling between different k is via the factor 
z = -\\v — u^Wr'^ (if p 7^ r), so one can take the minimizers of 

\x — y\ \x\ 

^y,aA^) = \-(7X + t 



as ansatz with a = zsnw"^ and t = zTnO-k- Since a enters only linearly in 
^y^o-.t, it is convenient to write argmin^gj^ '^y,cj,t{x) = (9^j,,t)^"'^(— a) with 

Sy^t, 

(24) 



^ 



y,t 



yf + T^. The latter can be expressed as {d^y^t 

is-l\-l 



SyA^) 

and 



yf ^ + tsgn(-)l 



(sgn(- -y)\ 

y + sgn(r,,i(x))|T,,t(x)|i/('-i) 



(x) for s > 1 
for s = 1 



^y,t(a;) 



for X < — sgn(y)[y 



r-l 



t 



x + t 

x — t for X > — sgn(y)|y|^^ +t 

,-sgn(y)|2/|''"^ else. 

It is notable that for s = 1, each Sy^t is a thresholding-like func- 
tion (see Figure [T|). For r = 2, one can easily verify the identity 
Sy^x) = sgn(x -|- y) [|x + y| — t], meaning that Sy^t is the well-known soft- 
thresholding. In order to get a solution for (I23p , one introduces the operator 
which applies Sy^t pointwise, i.e. Su,t{'w)k = Suf,,tk{'Wk) such that optimality 
is achieved if and only if 



Sn" 



,a{-ZTnW 



z = z V 



n \\r—p 



For r = p follows z = 1, so one can express the solution explicitly, for the 
other cases, it is necessary to compute the value for z numerically. 

To introduce some simplification, note that for n*^ = follows \\u\\^ < 
Ml < a-^^iu) < (F + $)(uO) yielding \\u\\r < (||/||y/(ar))^/'. The 
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forward-backward splitting algorithm for the solution of ()22p then reads as 
follows. 



^ /{ar)f/\ 



1. Initialize with n = 0, u*^ = and choose ||-F'||p_i locally for ||u||r. < 
(ll/lly/ 

2. Compute the value 

w^ = K*jr{Ku" - f) 

and solve, for a t„ satisfying ([2]), the scalar equation 

n+l _ C „ (-_-; ^ ,,,»^^ 2 _ P||,,"+l _7,"|r-P 

3. Take, if optimality is not reached, n"^^ as the next iterate and continue 
with Step 2 and n := n + 1. 

It is notable that for s = 1, the iterates are always sparse, as a conse- 
quence of the optimality condition for the auxiliary problem (I23p and the 
fact that sequences in P" are always null-sequences. Regarding the con- 
vergence of the algorithm, one easily verifies that Theorem [3] is applicable 
for s > 1 since $ is the s-th power of a equivalent norm on IT which is 
moreover g-convex and Y is, by assumption, g-convex (although this is ac- 
tually not needed) . It follows that in that case we have the convergence rate 
C)(n(i-P)/'?). 

For the case s = 1, <1> is not even strictly convex and things are a little 
bit more complicated. But basically, one only has to apply the tools already 
introduced in a slightly different manner. Also, the following condition turns 
out to be crucial for the argumentation: Let us assume that K possesses the 
finite basis injectivity property, that is that for every finite index set J C W 
the restriction K\j is injective (meaning that from Ku = and Uk = for 
each k ^ J follows u = 0). Under such an assumption, it is also possible 
to obtain a convergence rate of 0{n^^~^'''^) where q is the convexity of the 
space Y. 

This can be seen analogously to the proof of Theorem [3] and also follows 
a similar line of argumentation in [5]. Observe that for an optimal u* it 
holds that —F'{u*) = —K*jr{Ku* — f) (z d^{u*) or, somehow weaker. 



optimal 



Let u* be a minimizer and denote by J = {A; € IN : \K*jr{Ku* — f)\k = ctfc} 
which has to be a finite set since otherwise K*jr{Ku* — f) ^ P (remember 
the assumption a^ > a > 0). Likewise, there exists a p < 1 such that 
|if*jV(-f^^* ~ /)|fc ^ OikP foi" a-ll k ^ I. For convenience, denote by P the 
continuous projection {Pu)k = Uk{X]N\j)k, hj Q = I — P and define the 
semi-norms \z\i = ||-P^;||i in the space Z = P as well as \z\r = \\Pz\\r in P. 
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*jr{Ku* 


-/)lfc<«fc 


for n* = 


*jr{Ku* ■ 


-f)\k = »k 


for ul^O . 



We derive an estimate which somehow states the g-convexity of ^ with 
respect to the semi-norm | • 1^. Observe that for k ^ J, there has to be 
u^ = 0, hence one can estimate, for all 11^11^. < M, 

R{v) >Y.ak{\vk\ - \u*\) + F'{u*)k{vk - ul) >Y,ak{\vk\ - \F'{u*)k\\vk\) 
- (1 - P)^^^k\vk\ > (1 - p)a\v -u*\i . 

k<^J 

Then one estimates \v — u*\i > \v — u*\r > M^^'^\v — u*\r which leads to 

R{v)>c\v-u*\'^^ (25) 

for a c > which only depends on M. On the other hand, observe that 
a variant of Condition [T] is satisfied: £^ = Z = Zq® Z\ with Zq = rg(Q) 
finite-dimensional and hence closed in P' and | • |i being exactly the norm 
on Z\ = rg{P). By the finite basis injectivity property, K is injective on 
rg(Q) and since the latter is finite-dimensional, also continuously invertible. 
But, (j25p and the latter is exactly what is utilized in the proof of Theorem [3] 
to show the desired convergence rate. Hence, by repeating the arguments 
there, one obtains \\u"' — u*\\r = 0{n^^~P^^'i). 

Example 2. We like to present numerical computations for a variant of the 
algorithm developed in Example [H The problem we consider is inverting 
the integration operator on [0, 1] in LP{[0, 1]) — > i^([0, 1]) which is, for sim- 
plicity, discretized (with a delta-peak basis) and penalized with the discrete 
Li([0, l])-norm: 

\\Ku- f\F /■* 

min ^ ^-^ + a\\u\\i , Ku(t) = / u(s) ds . (26) 

«GLP([0,1]) p Jo 

The forward-backward splitting algorithm applied to the discretized problem 
then is just the iterative thresholding- like procedure computed in Example [T] 
restricted to finitely many dimensions. Computations for p = 1.5 and p = 2 
for noisy data have been performed, see Figure [2j The regularization param- 
eter has been tuned in order to yield approximately the same discrepancy in 
the respective norms. As one can see, choosing p less than 2 may favor more 
sparsity: Compared to p = 1.5, the solution for p = 2 has approximately 
50% more non-zero elements. 

Furthermore, as predicted by the theory, the numerical algorithm indeed 
converged with some rate, in practice, however, it turns out that the conver- 
gence is somewhat stable on the one hand but very slow on the other hand 
and many iterations are needed to achieve accurate results. 

The following example focuses on presenting an application in which it 
is natural to consider the Banach-space setting and on showing that the 
forward-backward splitting procedure leads to a convergent algorithm. 
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Ku: 




1 


— 1 1-> 



0.2 0.4 0.6 0.^ 



p \\Kup-fs\\p #NZ 11/ -Ml, 



a 



1.5 0.004651 40 0.004565 0.0025 

2 0.004669 61 0.005069 0.000022 

Figure 2: The outcome of the algorithm for the solution of l\26\\ . Top row: 
The exact solution (left) and the exact data / which is disturbed by noise 
fs (right). Middle rows: The results of the thresholding-like algorithm ui.5 
and U2 (left) as well as their images under K (right) for p = 1.5 and p = 2, 
respectively. Bottom row: The discrepancy (in the p-norm), the number of 
non-zero elements in the solution (out of 500, u has 9), the data error as well 
as the regularization parameter associated with ni.5 and U2, respectively. 
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Example 3. Consider the problem of restoring an image in higher dimen- 
sions with a total-variation penalty term: 

min "^^~^"p +aTV(n) (27) 

u£Lp{n) p 

with i7 C II being a bounded domain such that BV(r2) is compactly embed- 
ded in appropriate Lebesgue spaces. Here, K denotes a linear and continu- 
ous operator mapping L^{Q) -^ L^(J7) with 1 < p < d/[d — 1). This covers 
in particular the convolution with kernels which are only Radon measures, 
i.e. Ku = u * k with k G M.{M.'^), such that the usually assumed continuity 



LP {CI) — > L'^{Q) does not necessarily hold [3l|], especially for d > 3. More- 
over, in general, coercivity in L^(ri) fails and it is necessary to consider the 
Banach space setting. From [l|] we know that ()27|] indeed admits a solution 
in LP (CI) under general conditions. 

In the following, we focus on the problem of restoring a blurred three- 
dimensional image, i.e. Ku = u * k for some non-negative point-spread 
function k with ||A:||i = 1 and d = 3. Such a task arises, for example, 
in confocal microscopy [30]. In order to apply the algorithm, we have to 
solve auxiliary problems of the type 



I IIP 



mm 



vf^LPip) p 



^ + s((u;, u) +aTV(t;)) (28) 



with u € LPiCt) and w G LP iCt) which resembles the total- variation denois- 
ing functional. Following 20|, llO|, one approach to solve ([28|) is to consider 



the Fenchel-predual problem which equivalently reads as 

||div2;-su7||^, 
mm — + (divz, u) . 

||z||[X><SO P 

z-v=Q on 9f2 

In the discrete setting, with an appropriate linear discrete divergence op- 
erator, the above becomes a smooth minimization problem with convex 
constraints which can, for example, be solved approximately by a gradi- 
ent projection method [l5| or a semi-smooth Newton method [27']. Finally, 
once a solution z* of the predual problem is known, a solution of (j28p can 
be obtained by the corresponding Fenchel optimality conditions which, in 
this case, lead to the identity v* = u + jp' (divz* — sw). 

Hence, one can actually perform the forward-backward splitting proce- 
dure in practice. Regarding its convergence, we can are only able to apply 
Proposition [5] and get weak convergence of {u"} to the minimizer n* in 
IPiCC) (note that the minimizer has to be unique since K is injective). But 
additionally, one easily deduces that {BV(tt")} is also bounded which gives, 
by compactness, the strong convergence in case \ <p < 3/2. Consequently, 
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considering the inverse problem in Banach space yields a convergent algo- 
rithm for regularization with the total-variation semi-norm. The conver- 
gence, however, comes without an estimate for its speed. 

Based on the arguments presented above, numerical computations have 
been carried out. You can see the outcome of the algorithm for some sample 
data in the Figures [3] and [H 



6 Summary and Conclusions 

The aim of this paper was to show that there is a meaningful generaliza- 
tion of the forward-backward splitting algorithm to general Banach spaces. 
The main idea was to write the forward-backward step as a minimization 
problem (j3]) and to generalize this problem to ^ which defines the iter- 
ation. Convergence of this procedure was achieved by proving a descent 
rate of n^~P of the functional distance on the hand and utilizing notions of 
convexity of the non-smooth functional <1> to establish norm-convergence of 
rate n^^~P^''^. This rate is, however, rather slow in comparison to, e.g. linear 
convergence. But, we have convergence nevertheless, and in order to prove 
that the general procedure converges, it suffices to look at the functional for 
which the backward-step is performed. 

These abstract results were applied to the concrete setting of Tikhonov 
functionals in Banach space. The forward-backward splitting algorithm 
was applied to the computational minimization of functionals of Tikhonov- 
type with semi- norm regularization and, using Bregman- Taylor-distance es- 
timates, convergence was proven provided the linear operator has a con- 
tinuous inverse on the space where the semi-norm vanishes. In particular, 
convergence rates translated to the convexity of the regularizing semi-norm 
as well as to the smoothness and convexity of the underlying data space, the 
latter originating from the situation that semi-norms are in general invariant 
on whole subspaces. 

As the examples showed, the results are applicable for Tikhonov func- 
tionals considered in practice. In particular, the algorithm can be used 
to deal with sparsity constraints in Banach space and to derive a conver- 
gent generalization of the popular iterative soft-thresholding procedure by 
Daubechies, Defrise and De Mol to Banach spaces. The resulting algorithm 
shares many properties: It is easy to implement, produces sparse iterates, 
but also converges very slowly, what was to expect since its prototype in 
Hilbert space also admits very slow convergence. Thus, there is the need to 
accelerate the procedure by, for example, utilizing better step-size rules or 
higher-order methods. Finally, the method also works for image restoration 
problems of dimension three which can, in general, not be solved in L'^(Q) 
anymore. Although the theory does not yield estimates for the convergence 
rate, we are still able to obtain strong convergence in some L/'{i}). 
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Figure 3: Numerical illustration of three-dimensional total- variation regu- 
larization for image deblurring. On the left hand side, you can see, respec- 
tively, the 2D-slices of the 3D-dataset, while on the right hand side, some 
isosurfaces of a cut of the data is depicted. The rows show, from top to 
bottom, the original artificially created data u, which has been blurred and 
disturbed with noise to form the data / and the outcome of the iterative 
forward-backward splitting algorithm u* for the deblurring problem with 
total-variation penalization. 
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Figure 4: Reconstruction of noisy blurred three-dimensional microscopy data 
showing cortical neurons in transgenic mice. Again, slices and isosurface 
representations of the true image u, the noisy data / and the solution of the 
TV-regularization problem u* (from top to bottom) are depicted. Here, the 
minimization algorithm is also able to remove the noise artifacts from the 
data. However, a reduction of the contrast and some loss of detail can be 
observed as it is typical for total-variation based regularization. (Dataset 
from http://152.19.37.82/pages/datasample.php, see also [18,]). 
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